MARKOV DEGREES OF HIERARCHICAL MODELS DETERMINED 
BY BETTI NUMBERS OF STANLEY-REISNER IDEALS 



SONJA PETROVIC AND ERIK STOKES 

Abstract. There are two seemingly unrelated objects associated to a simplicial com- 
plex: a hierarchical model and a Stanley-Reisner ring. A hierarchical model gives rise 
to a toric ideal, its generators providing a Markov basis for the model — a relationship 
that is a staple of algebraic statistics. In this note, we explore the connection between 
minimal generators of this toric ideal and syzygics of the Stanley-Reisner ideal. We 
propose a precise conjecture, supported by extensive computations, and we prove it in 
several cases, most notably for vertex-decomposable and decomposable complexes. 

As a first result establishing a relationship between the Stanley-Reisner ring and the 
ideal of the model, we hope this inspires a further study of the effect of the algebraic and 
geometric invariants of the classical combinatorial object on the underlying algebraic 
statistical model. 



1. Introduction 

Among the growing aspects of applied algebraic geometry, algebraic statistics takes a 
prominent role. It can generally be defined as the use of algebraic and geometric methods 
for the study of (algebraic) parametric statistical models. There is a natural relationship 
to algebraic geometry which has generated studies of families of models from this point 
of view. Roughly, each probability gives rise to a polynomial in the model parameters, 
inducing a polynomial map. The real positive part of (the closure of) the image of this 
map is the statistical model of interest, and it is also an algebraic variety. The special 
case of interest is when the images are monomial. The map is then encoded by a matrix 
A, and its image is a toric variety. Any generating set of the defining ideal of this 
variety is a Markov basis for the model; this is the Fundamental Theorem of Markov 
bases that appeared in the breakthrough paper [7]. A Markov basis is a set of moves 
guaranteed to connect all points in any of the fibers, that is, sets of points with the same 
sufficient statistics, thus enabling us to take random walks on the fibers. Markov bases 
provide an alternative, non-asymptitic approach to performing goodness-of-fit tests and 
model selection. Their use has increased in recent years, especially in the models where 
standard tools do not scale well. From a large and growing literature in the area, let us 
single out [TT], [2H] and [27], accessible to algebraists. 

Hierarchical models (defined in [13], see also [20]) are algebraic statistical models such 
that relationships between m random variables are described by a simplicial complex A 
on m vertices. We defer the formal definition and detailed discussion until Section [31 We 
are interested in the problem of understanding how properties of the simplicial complex 
determine the properties of the ideal /m(A) of the hierarchical model, where M(A) plays 
the role of A in the definition of the toric ideal of the model. For example, it would be 
of interest to bound the Markov width of the model, defined to be the largest degree of 
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a minimal generator of Im(A)- There are several interesting results in this direction, for 
example [20], [6], and related recent work in [2], [19]. In addition, if A is a reducible 
complex, then Im(A) is a toric fiber product, so it is possible to lift the generators 
inductively (for the most general construction, see [50]). 

The Stanley-Reisner ideal is a well-studied ideal associated to a simplicial complex A 
on m vertices; it is the squarefree monomial ideal /a C K[xi, . . . , x^] generated by the 
minimal non-faces of A. Betti numbers of I a encode the ranks of the syzygy modules in a 
minimal free resolution of its coordinate ring R/Ia- For formal definitions, see Section[2j 
The main results in this paper, contained in Section |H provide a first link between Betti 
numbers of I a and the degrees of minimal generators of /m(A)- Namely, we prove 
the following conjecture for several famihes of complexes (e.g., vertex-decomposable, 
decomposable, Gorenstein of small dimension): 

Conjecture 1.1. Let A be any simplicial complex. Suppose that the Betti diagram 
of the minimal free resolution of R/Ia has a non-zero entry in the j-th row (that is 
Pi,i+j{R/lA) 7^ for some i). Then the toric ideal of the hierarchical model, Im, has a 
minimal generator with degree 2^ . 

In addition, the Conjecture was confirmed computationally for many complexes, using 
4ti2 [T], Macaulay2 [TB] and Sage [25]. The claim is a generalization of the observations 
from Section 14. 1^ which come about naturally by asking whether I a having a degree-j 
minimal generator implies that Im has a degree 2^~^ minimal generator (see 14.17]) . Note 
that the j-th row in the Betti diagram of any homogeneous ideal is non-zero whenever 
the ideal has a degree j + 1 minimal generator. Thus, we ask the more general question: 
whether syzygies of the Stanley-Reisner ideal give any further information about the 
toric ideal of the hierarchical model. 

The Betti diagram of Ia can vary with the field for general A: for example, the Alexan- 
der dual of any triangulation of the projective plane changes regularity when the field 
has characteristic 2. (Recall that the regularity is the largest j so that Pi,i+j[R/lA) 7^ 0.) 
Therefore, for those complexes for which Conjecture II. II is true, we can choose a "worst- 
case" field, where the regularity is largest, thus giving us most information about the 
Markov degrees of the hierarchical model. 

Example 1.2. Consider the binary model of a 4-cycle A with facets {12}, {23}, {34}, {14} 
The Stanley-Reisner ideal {xiX^,X2X/i) has the following Betti diagram: 



Since the Conjecture is true for A (see Theorem 14.81) . the toric ideal Im must have 
generators in degrees 2^ = 2 and 2^ = 4. In fact, in this case, Im happens to be 
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generated solely in these degrees: 

= (PlOllPlllO ~ PlOloPllll! PlOOlPllOO ~ PlOOOPllOl! 

PoiiiPiioi — P0101P11115 PoiioPiioo ~ P0100P11107 PooiiPoiio ~ PooioPoiii) 

PooiiPiooi — P0001P10115 PoooiPoioo — J'ooooPoioi 5 PooioJ'iooo — PooooPioio, 

PoiooPoiiiPiooiPiolo ~ PoioiJ'oiioPioooJ'ioii) J'ooioPoioiPioiiPiloo ~ PooiiPoiooJ'ioioPiioi) 

PoooiPoiioPioioPiioi — PooioPoioiPiooiPiiiO) PoooiPoiiiPioioPiloo ~ PooiiPoioiPioooPiiiO; 

PooooPooiiPiioiPiiio ~ PoooiPooioPiiooPiiii) PooooPoiiiPiooiPiiio ~ PoooiPoiioPioooPiiii) 

PooooPoiioPioiiPiioi — PooioPoiooPiooiPiiii) PooooPoiiiPioiiPiloo ~ PooiiPoiooPioooPiiii)- 

We can settle the conjecture for several special classes of complexes, most notably, for 
vertex-decomposable (Theorem 14. 8p and decomposable (Proposition 15.51] . In addition, 
in Section 14.31 we outline a way to construct some of the predicted Markov moves from 
our main results. 

The family of decomposable complexes arises in the theory of graphical models. For a 
detailed study, see for example [9] and [8] , where they appear under the name decompos- 
able graphical models. They correspond to the hierarchical models of clique complexes 
of chordal graphs (see Theorem 3.3.3. in [H]). In particular, it is known that if the 
complex is decomposable, then the toric ideal is generated by quadrics (and even has a 
quadratic Grobner basis, see [T5]). 

Clique complexes of chordal graphs make an important appearance in commutative 
algebra as well: they are known to have linear resolutions by a result of Froberg [H] 
(see also [10]). We can recover this result via algebraic statistics! Namely, if a complex 
satisfies our conjecture, then the following is clear: 

Corollary 1.3. // the ideal of the hierarchical model satisfying \1.1\ is generated in a 
single degree, then the Stanley- Reisner ring of the complex has a linear resolution over 
any field. 

For more details on decomposable models, see Section O A geometric interpretation 
of the results is discussed briefly in Section O Our computations are summarized in 
Section [71 

This new connection begs for interpretation of other classical numerical invariants of 
the coordinate ring. In terms of the model, these questions are still unexplored. Our 
hope is that they can provide deeper understanding of the geometry of the models in 
relation to the properties of the Stanley-Reisner rings. 

2. Syzygies of Stanley-Reisner ideals 

In this section we briefly recall some basic definitions and notation regarding Stanley- 
Reisner ideals and their resolutions. 

Let K be any field, R := K[xi, . . . ,Xn] and define Xg- := riiecr-^* cr [n] = 
{!,... ,n}. Then the Stanley-Reisner ideal of the simplicial complex A is 

I A := (xa I (7 ^ A) C R. 

The elements of A are called faces, with the dimension of a face F defined as dim F = 
\F\ — 1. Faces with dimension and 1 are called vertices and edges, respectively. The 
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dimension of A is dim A := maxjdimF | F G A}. The inclusion-maximal faces of A 
are called facets. If all the facets have the same dimension, we say that A is pure. If A 
has some vertex v that is contained in every facet, then we say that A is a cone with 
apex V. 

There are three basic constructions for a simplicial complex: restrictions, deletions, 
and links. For a C [n], the restriction of A to a is defined as 

A|, := {F G A I F C a}. 

The deletion of v from A is 

A_^ := {F G A I ^ F}. 

The link of a complex with respect to F C [n] is defined to be 

linkA(F) := {G G A I G n F = 0, F U G G A}. 

Given any homogeneous ideal, I R, the minimal free resolution of R/I is the exact 
sequence of free i?-modules 

^ Fp ^ > F2^ Fi^ R/I ^0. 

Given a G Z, define R{a) to be the ring R with the degrees shifted by a, that is, a degree 
i element of R{a) is a degree i + a element of R. Then each term in the above resolution 
can be written 

F. = ^R{-jr^^. 

The numbers Pij{R/I) are called the graded Betti numbers and (3i{R/I) := Pij{R/I) 
the total Betti numbers. If / and R are understood from the context, we simply write 
for the graded Betti numbers. Typically, we summarize this numerical data in a 
standard Macaulay2 [16] Betti diagram, a table whose (i, j)-th entry is 

i 

total: ■ ■ ■ /3j 
3'- ■ ■ ■ 

This is the notation used in Example II. 21 where, in the interest of readability, we use ■ in 
place of 0). For additional details on simplicial complexes and minimal free resolutions 
see [25] or [1]. 

The fundamental result describing the Betti numbers of Stanley-Reisner ideals is 
Hochster's formula, which relates the Betti numbers to the simplicial cohomology of 
the complex. Definitions and details on simplicial cohomology may be found in any text 
on algebraic topology, for example [T7] . 

Theorem 2.1 (Hochster's Formula). The graded Betti numbers of R/I^ are given by 

kl=i 

Proof. This result is Corollary 5.12 in [25] and we refer the reader there for a proof. □ 
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3. Markov bases of hierarchical models 

Hierarchical models ([13], [20]) are algebraic statistical models such that relationships 
between m random variables are described by a simplicial complex on m vertices. In 
essence, the facets of the complex determine which margins of the corresponding m-way 
contingency table serve as minimal sufficient statistics for the model. More precisely. 

Definition 3.1. 

(a) Given a. di x ■ ■ ■ x table, T G (S'j=i ^'^^ ^ — ['A define the F -margin of 
T to be 

(b) Let A be a simplicial complex on [n], facets Fi, . . . ,Fs and d = [di, . . . , dn)- Then 
we define a linear map 0a = (pA,d- 'S>^=i ~^ ®i=ii'S)jeFi ^'^^) sending a table T 
to {Tp^, . . . , TpJ. We define M(A, d) to be the matrix representing this map. 

Example 3.2. The matrix for the complex with facets {1} and {2} is 

■ 1 1 " 

11 
10 10 
10 1 

The map : — t- © R^, restricted to the probability simplex, sends a 2 x 2 con- 
tingency table [Tij] to the 1-dimensional margins obtained by summing over each index: 
[Tij] i-T- ^y]i=i,2) Ei ^ii]i=i,2)- These are the row and column sums of the table. 

Hierarchical models are a subclass of log-linear models, sets Ma of all probability 
distributions whose logarithms are in the linear span of the rows of some matrix A. The 
fiber of A at u, denoted by J^{u), is a set of all points v such that Av = Au. A Markov 
basis B C ker A for the model Ma is a finite set of tables, called moves, that connects 
every fiber J^{u) in the following sense: if Ti,T2 G J^{u) then Ti = T2 -|- ^ m j , for some 
collection of moves {rrii} C B. This set of moves allows one to perform random walks 
on a fiber and can be used for testing goodness of fit of the model. 

The matrix A of the log-linear model JUa determines a toric ideal 

:= (^x"" - x'' : u - V e ker A) 

in the polynomial ring S := K[xi, . . . ,X]sf], where is the number of entries in the table. 
A standard reference on toric ideals is [2U] . 

Example 3.3. Extending the map from Example 13.21 to an arbitrary 2x2 table: 

: K[xii, X12, X21, X22\ K[yi, 1/2, ^i, ^2] 

where K is a. field, we see that the image of restricted to the probability simplex is 
exactly the image of 0. (Namely, Xij represents the (i, j)-entry in the contingency table 
whose entries are probabilities, thus are nonnegative and with overall sum 1.) The toric 
ideal ker is generated by X11X22 — a^i22;2i; the 2-minor of the generic 2x2 matrix. The 
map parametrizes the Segre embedding of x P-"^ — P^. 
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A starting point of the field of algebraic statistics is the realization that generators of 
the ideal I a form a Markov basis for the model A4.a'i this is the Fundamental Theorem 
of Markov Bases due to Diaconis and Sturmfels ([7], see also Theorem 1.3.6. in 

In [20], Ho§ten and SuUivant observe that if M(A, d) is the design matrix of a hi- 
erarchical model for the complex A with variable Xi taking values in {1, . . . , rfj}, then 
M(A, d) is of the following form: 

"AO ... 0" 
A ... 



M(A,ci) 



... A 
B B B ... B 



They also note that A and B are matrices corresponding to the link of vertex 1, linkA(l) 
(recall Section [2]), and to the complex generated by the facets that do not contain 1, 
respectively. Note that we can relabel the vertices so that any vertex is vertex 1. 

From here on we adopt the following notation: A is a simplicial complex on n vertices, 
d = {di, . . . ,dn) a sequence of non-negative integers and M = M{A,d). In addition, 
A and B are the matrices from the above decomposition of M. The toric ideal of the 
model is denoted by Im and hves in the polynomial ring S. 

We would like to relate B to the more usual deletion complex, A^^,; recall Section [2l 
To that end, let A^_^) be the complex consisting of all the facets not containing v along 
with all their subsets. Under some assumptions, these two complexes agree. 



Lemma 3.4. //dim A„^ = dim A and both complexes are pure, then A„^ = A 



Proof. The containment A(_t,) C A_^ is clear, since any facet not containing v is in A.^,. 
For the opposite inclusion, let F G A_„ be a facet. Then either F or FU{v} is a facet of 
A. In the first case, F G A|;„^). The purity and the assumption that dimA_t, = dim A 
eliminates the second case. □ 

The assumptions in Lemma [3.41 are necessary, as the following example shows. 

Example 3.5. Consider the complex A with facets {1,2,3}, {1,4} and {2,4}. Then 
A_i has facets {2,3} and {2,4}. However, {2,4} is the only facet of A not containing 
1, so it is the only facet of A(„i). Note that 2 = dim A > dim A^_i) = 1. 

When A is pure, dimA_^, < dim A if and only if A is a cone with apex v. We can 
assume that v = 1, relabeling vertices if necessary. We adopt the convention that if 
every facet of A contains 1, then B = [ ] is the empty matrix and keri? = 0. Thus, if 
A is a cone with apex 1, 

ker M = ker A © ■ ■ ■ © ker A . 

^ V ' 

n summands 

Indeed, since 1 is the apex of the cone, there are no facets not containing 1, thus B is the 
empty matrix. We can now write M in block form, with copies of A down the diagonal. 
The statement above is now a straightforward observation. 

It is interesting to note that in some special cases, the matrix of the hierarchical 
model has a structure that makes an appearance in integer programming. Namely, if 
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dim A = d and the only facet not containing 1 has dimension d, then B = I, the identity 
matrix, and M becomes a higher Lawrence lifting of A, denoted by Adj^{A). Then the 
Graver basis of Im coincides with any minimal Markov basis and any reduced Grobner 
basis [ini Theorem 7.1]. 

To close this section we consider two more examples. We say that a model is binary 
if each node can take exactly two states: di = ■ ■ ■ = (i„ = 2. 

Example 3.6 (Boundary of a simplex). Let F be the boundary of an ra-simplex and con- 
sider the binary hierarchical model determined by F. The toric ideal Im '■= -^M(r,(2,...,2)) 
is a principal ideal generated by the degree 2"~^ binomial mi — m2, where mi is the 
product of the variables with even index and m2 those with odd index. 

For convenience of the reader, we provide a short proof. Notice that the deletion of 
any vertex of F is a (n — l)-simplex, and the link of the vertex is the boundary of the 
deletion. Since the model is binary, the matrix has the form: 



where I is the 2^~^ identity matrix. For n = 2, the result can be computed by hand. 
By induction on n, I a is generated by the binomial associated to the vector f , where 
Vi = (—1)*"^. Since ker/ is trivial, the only elements in kerM are of the form (a, —a) 
for a G ker A. Then [v, —v) gives the only minimal generator of Im- 

If A has a zero-dimensional connected component, then relabeling vertices, if neces- 
sary, ensures that linkA(l) is trivial. Thus, information about the trivial complex can 
then be used to extract information about zero-dimensional complexes. 

Example 3.7 (The trivial complex). Let A = {0} C 2'"]. Then a Graver basis of Im is 
given by the collection of all degree 1 binomials in S. A minimal Markov basis is given 
by the binomials of the form xi — Xi for 1 < i < N. This becomes a simple observation, 
once we see that the complex A has a single facet, 0, implying that M = [1 1 ■ ■ ■ 1]. 
The details are left as an exercise. 



We have seen that a simplicial complex defines a monomial ideal and a toric ideal. 
In this section, we propose a relationship between some of their numerical invariants. 
First, we study the relationship between initial degrees of the two ideals. Then, we prove 
Conjecture II. II for vertex-decomposable complexes. In particular, the proof in this case 
allows us to show how to construct some of the moves explicitly. Two more special cases 
follow, and we conclude the section by showing that the converse of the Conjecture does 
not hold. 

4.1. Initial degrees. The initial degree of a homogeneous ideal /, init(/), is the smallest 
degree of a minimal generator of /. 

It is generally convenient to be able to assume that /o(A) = n, that is, that A contains 
every vertex. On the level of Stanley- Reisner ideals, /o(A) < if and only if Xj G I a 



M 



A 

/ 




A 
I 



4. From Betti diagrams to Markov degrees 
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for some i. In this case, we can pass to the quotient I a/ {xi) and study its properties 
instead. The foUowing aids us in performing a similar operation on Im'- 

Observation 4.1. Let A be a simplicial complex on [n] such that /o(A) < n (that is, 
there is some vertex that is not in A). Then Im contains a linear form. 

Proof. Without foss of generahty, suppose that {1} ^ A. Then, we may write M = 
[B ■ ■ ■ B]. Clearly, there is a linear form in ker M. For example, (1, 0, . . . , 0, — 1, 0, . . . , 0) G 
ker M, where the —1 is in position YYi=2 di + ^- n 

Initial degrees of generators of Im and Ja are related as follows. 

Proposition 4.2. Let k = init(/A)- Then init(/j\,/) = 2^"^ provided that di > 2 for 

1 < i < n. 

Proof. The proof is by induction on k. li k = 1, then there is some vertex not in A, and 
Observation 14. II implies that Im contains a linear form, therefore init(/j\f) = 1. Suppose 
k > 1. Let w be a vertex in the support of a minimal generator of Ja with degree k. 
Then, the Stanley-Reisner ideal of linkA(f), regarded as a complex over [n] — {v}, is 
{{I A - Xy) + Xv)/ {xv) C R/ [xv). Due to our choice of u, this ideal has initial degree 
k — 1. For convenience, we relabel the vertices of A so that v is labeled as vertex 1, 
noting that this does not change the initial degrees of I a or Im- 

By induction, I a has initial degree 2^^"^. Let u e ker A have minimal degree. Then 
{u, —u, 0, . . . , 0) G ker M has degree 2 ■ 2'^"^ = 2*^~^. We claim that this is the small- 
est degree of any element of ker M. Suppose, to the contrary, that there is some 
(ai, a2, . . . , ttdj e ker M with degree less than 2^~^. Then at least one of the a^'s must 
have degree less than 2^~^/di < 2^^'^ . Now a,/ G ker A provides a contradiction. □ 

Remark 4.3. Note that the size of the model in Proposition 14.21 is irrelevant to the 
initial degree of I a (and thus Im\ which is determined entirely by the combinatorial 
properties of A: 

init(/A) = min{|F| | F ^ A}. 

Proposition 14.21 provides two families of complexes for which the toric ideal Im has 
initial degree 2. The first family is the set of all 1-dimensional complexes A, except K^. 
Namely, since A is not the complete graph, there is some edge {«, j} ^ A, giving a degree 

2 generator XiXj of /a- Since A contains every vertex, init(/A) = 2, thus init(/jvf) = 2. 
(When A = Kn, the initial degrees are higher: init(/i^,J = 3 and init(/M) = 4.) The 
second family is that of disconnected complexes, whose Stanley-Reisner ideals also have 
initial degree 2: otherwise, their 1-skeletons would be which is connected. 

We can say a bit more about the generators of smallest degree in Im- 

Lemma 4.4. Let D = init Im- Then, if (mi, . . . , Ud^) G ker M with Ui G ker A has degree 
D, it is squarefree. 

Proof. Labeling the vertices properly so that xi is as in the proof of Proposition 14. 2[ we 
may assume that init(/yi) = D/2. Since the degree of any of the Uj's can not be smaller 
than D/2, at most 2 of the Uj's are non-zero: say, Uj =: u and Ui =: v. There are two 
cases: degu = degf = D/2, oi one of the vectors is while the other has degree D. In 
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the first case, each of u and v must be in the Markov basis for A. By induction on n, 
both are squarefree and thus the concatenation is also squarefree. 

In the second case, since the nonhnear generators of the Stanley-Reisner ideal of the 
deletion are also minimal generators of the Stanley-Reisner ideal of A, we see that having 
a degree D element in Ib forces init Is = D. The claim now follows by induction on the 
number of vertices. □ 

It is worth noting that one cannot conclude similarly for larger-degree generators of 
Im', they may or may not be squarefree. 

Remark 4.5. The results of Proposition 14.21 and Lemma [4.41 have been proved, using 
only linear algebra, in [221 Theorem 5]. 

Remark 4.6. The only case in which Im has initial degree 1 is when there is some 
vertex not contained in A. In this case, we may simply take quotients to pass to smaller 
polynomial rings, and thus assume that init (/a/) > 2. It then follows from Lemma [4.41 
that all degree 2 elements in any Markov basis are squarefree. The squarefree degree 2 
Markov moves are studied in [18] under the name "primitive moves" . 

4.2. Vertex-decomposable complexes. 

Definition 4.7. A simplicial complex A is vertex decomposable if A is pure and ei- 
ther A = {0}, or there is some vertex v such that A_^ and linkA(w) are both vertex 
decomposable. 

Requiring A to be vertex decomposable satisfies the assumptions of Lemma 13. 4[ and 
allows us to delete the vertices one at a time and use it inductively. Vertex decomposable 
complexes are shellable, and thus Cohen-Macaulay ([251 Proposition 16.42]). The class 
includes matroid complexes (that is, complexes whose restrictions are all pure), and 
all 1-dimensional, connected complexes. Most importantly, they provide a first (large) 
family of complexes satisfying the Conjecture. 

Theorem 4.8. Let A be a vertex decomposable complex. Suppose that the Betti diagram 
of the minimal free resolution of R/Ia has a non-zero entry in the j-th row (that is 
Pi^i+j{R/lA) 7^ for some i > 0). Then Im has a minimal generator with degree 2^ . 

Proof. If necessary, relabel the vertices so that A_i is vertex decomposable. By defini- 
tion, A is pure, thus A(_i) = A_i. If A is a cone with apex 1, the result is immediate 
by induction, since xi being a regular element implies that Ja and Ja_i have the same 
Betti diagram. Thus, we may assume that xi divides some minimal generator of Ja =: I- 
Then, multiplication by xi induces 

R/{I : xi)(-l) — R/I R/{I + xi) 0. 

The mapping cone construction (e.g., see [12]) provides that the i-th row in the Betti 
diagram of I must come from either the j-th row of the Betti diagram of /?/(/ + xi), or 
the (j — l)-th row of that of R/{I '■ Xi). Note that R/ (/-|-a;i) and R/ (/ : Xi) correspond to 
the matrices B and A, respectively (the second because A is vertex decomposable). Since 
the links and deletions of vertex decomposable complexes are again vertex decomposable, 
the induction hypothesis provides that the result holds for either I a or Ib- There are 
two cases. 
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Case I: Assume that S/ {I : Xi) has a non-zero entry in row j — 1 of its Betti diagram. 
By induction, has a degree 2^~^ minimal generator x'* — . Then, the 
vector (a, —a, 0, . . . , 0) G ker M has degree 2^ . Note that the binomial of this 
form is a minimal generator of Im- 
Case II: Now we assume that it is S/ (/a +2^1) whose Betti diagram has a non-zero entry 
in its j-th row. Let x^^ —x^ E Ib be the corresponding minimal generator of 
Ib with degree degx'' = 2^. We claim that keri? C ker A. Indeed, let T be a 
table in ker 5, and F a face in linkA(l). Since, by definition, linkA(l) C A_i, 
the F-margin of T being zero implies that T is also in ker A. 

Therefore, (6, 0, . . . , 0) G kerM where h := — b~ . The corresponding 
binomial is clearly a minimal generator, since it is minimal in Ib- 

□ 

A few remarks on the proof and possible generalizations of the result are in place. 
First, note that the use of the mapping cone construction requires information about 
links and deletions, making the class of vertex-decomposable complexes a natural family 
to apply it to. On the other hand, the technical observation of the kernel inclusion 
uses purity: it is a special case of the statement that if F C A is a subcomplex, then 
kerM(A) C kerM(F). In our case, we used that A_i = A<_i> and that link is a 
subcomplex of the deletion. 

In addition, note that the kernel inclusion used in Case II does not hold in general: 
Consider the complex with facets {1, 2} and {3}, for the binary model. Then the Markov 
basis of A consists of (0, 0, 1, —1) and (1, —1, 0, 0) while that for B has (0, 1, 0, —1) and 
(1, 0, —1, 0). Neither of the elements is in the Markov basis of B is in ker A. 

4.3. Constructing the predicted moves. Predicting Markov degrees is important 
for algorithms in algebraic statistics. On the other hand, it would be even better to 
predict actual Markov moves! In general, this is a nontrivial task; but for the case 
vertex-decomposable complexes, one can — in principle — follow the argument in the 
proof of Theorem 14.81 and construct the moves recursively, from links and deletions. Let 
us outline how a part of the procedure works for a binary hierarchical model (that is, 
for the case when all random variables have two states). We will consider the case when 
Ja has a generator of degree i, and construct the generator of Im of degree 2*~^. 

The results that follow are easiest to state in tableaux notation, which can be found, for 
example, in [TTl Chapter 1] . We consider the elements of the Markov basis as binomials 
in K[ph I 6 G {0, 1}"] and write the indices as the rows of two matrices. For example, 
PooioPoooo - PioioPoioi is written 









1 


0" 




'1 





1 


0" 



















1 





1 



The constructions used in the proof of Theorem 14.81 can be written in tableaux notation 
as [m] - [w] I-)- ^] - [? J^] (equivalent to a t-)- (a, -a)) and [u] - [v] [0 u] - [0 v] 
(equivalent to 6 t— )■ (&, 0)). Similarly, b i— )■ (0,6) is realized by replacing the column of 
O's in the second case by a column of I's. Now, given a minimal generator of /a with 
degree i, we explicitly construct a binomial with degree 2*~^, starting with the case 
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where i = 1. Throughout the remainder of this section we assume that A is vertex- 
decomposable and the vertices have been ordered so that deleting 1, 2, 3, . . . , n gives the 
needed decomposition, and that all the models are binary. 

Lemma 4.9. The binomial [u] — [v] is a linear generator of Im (here u and v are row 

vectors ) if and only if 

(1) Uj = Vj for every j ^ A 

(2) If j G A then Uj = and Uj ^ Vj for some j G A. 

Proof. The given tableaux can be in kerM if and only if the columns of M indexed 
by u and v are equal, which, by definition of M, is equivalent to the given conditions. 
(Alternatively, it can be shown by following the proof of Theorem 14.81 ) If A = {0} then 
this is Example 13.71 where u = (0, . . . , 0) and v is anything else. 

By relabeling the vertices, we may, without loss of generality, assume 1 G A. Then 
j ^ A if and only if j ^ A_i. By induction, Ib has a linear element [u'] — [v'], where 
u' and v' satisfy (1) and (2). Then [0 u'] - [0 v'] G Im and [1 u'] - [1 v'] G Im. The 
converse follows similarly. □ 



Before presenting a general method in Proposition 14.111 it is instructive to see an 
example. 

Example 4.10. We can construct some quadratic Markov moves for the complex with 
facets 

{{12}, {13}, {14}, {15}, {23}, {24}, {25}, {34}}. 

This has Stanley-Reisner ideal (2:42:5, 0:3X5, X1X2X5, X1X3X4, X2X3X4, X1X2X4, X1X2X3). Con- 
sider the degree 2 monomial 0:3X5. After deleting the vertices 1 and 2 we link 3 to get 
the complex A' = link(A_i)_2(3)- Since X3X5 G /a, 0:5 G /a'- By Lemma IT9] the binary 
Markov basis of A' contains the linear binomials [0 0] — [0 1] and [1 0] — [1 1] . Then (using 
the first binomial) we get that [lo?] — [100] is a minimal generator of the toric ideal 
for (A_i)_2 by applying Case I of Theorem 14.81 Now we lift back into A by appending 
columns of O's (repeatedly applying Case II) to the front of this and get that 




10 1 



1 
10 



is a minimal generator of Im- By altering this construction slightly, any of the first, 
second or fourth columns could be 1 instead of (but must be the same on both sides), 
giving us a total of 8 minimal generators. Starting with X4X5 and following the same 
procedure gives us the minimal Markov move 




1 1 



1 
1 



As before, the first, second and third columns can contain I's instead of O's, giving us 8 
more quadratic generators, for a total of 16. On the other hand, computing the Markov 
basis with 4ti2 gives that there are 24 quadratic moves in total. We will address this 
discrepancy shortly. 
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For a general recipe, we introduce notation to describe the columns of the tableaux 
that we can construct using the procedure of the preceding example. First, we define 
the alternating vectors by 

el 
ei 

I times 

= [4 ^ e]^ i 

so that £4 = [0 1 1]"^ while £3 = [0 1 1]"'". In general, e\. has alternating blocks of O's 
and I's; k is the length of the blocks and i the number of blocks. 

Additionally, we define column vectors by a\ = [0] and = 0^77]^, where a 
is the binary complement of a, defined by ttj := 1 — aj. Here, ai = [0 1]"^, ^2 = [0 1 1 0]"^, 
ttg = [0 1 1 1 1]^, and so on. 

Proposition 4.11. Let m = Xi-^ ■ ■ ■ Xi^ be a minimal generator of with ii < 12 < 
■ • ■ < id- Then [u] — [v] is a minimal generator of Im with degree 2'^~^ if, for j < d, 
the ij-th columns of u and v are e'^^-j, o,nd the id-th columns of u and v are ad and Od, 
respectively. The other columns of u and v are equal to each other and each is either 
[0 ■ ■ ■ 0]'^ or [1 ■ ■ ■ 1]-^. There are a total of 2""^^ such moves for each degree d minimal 
generator of I^. 

Proof. We induct on d. li d = 1 this is Lemma 14. 9[ so we assume that d > 1. If 
ii = 1 then consider linkA(l). Then minimal generator of Ia - Xi, which is 

(after modding out by Xi) the Stanley- Reisner ideal of the link. By induction we have 
a minimal generator of I a, [u'] — [v'], where the columns of u' and v' are as described. 
Then, as in Case I of Proposition 14. 8[ we have that 



u 




'0 v' 


1 V 




1 u 



is a minimal generator of Im- Note that, by definition, the first column of this matrix 
is e^a-i and the others or either 0, 1 or e^^^^ = [efa-j e'^d-jY ■ 

If Zi 7^ 1 then delete vertices 1,2,. — 1. Note that m is a minimal generator of 
/A + (a;i, . . . , Xj^-i), the Stanley-Reisner ideal of the deletion by the first ii — 1 generators. 
Then, by induction on the number of vertices, we have that the toric ideal of this deletion 
contains a minimal generator [u'] — [v'] with the columns as described. Now form the 
binomials [0 u'] — [0 v'] or [1 u'] — [1 v'] repeatedly lifting the binomials to the previous 
deletion, and eventually reaching Im- D 

Example 4.12. Using the same complex as in Example I4.10[ we use Proposition 14.111 
to construct a degree 4 minimal Markov move from the generator XiX^x^^ . The first 
column is £2 = [0 1 1]^, the third £2 = [0 1 !]• The the fourth columns of u and v 
are 02 = [0 1 1 0]^ and = [1 1]^. The other columns we may fill with O's or I's 



k times 




[4' 



-llT 
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as we like. We get 



along with 3 other similar binomials. Iterating over all the cubic generators of /a we 
construct a total of 20 degree 4 binomials, out of 520 that a minimal Markov basis 
contains. 



Remark 4.13. Each of the moves, [u] — [v], that Proposition 14.111 constructs from a 
degree d monomial has the property that all of the columns of u and v, except one, are 
equal, and all but d of the columns are either all O's or all I's. Tragically, not every 
degree 2'^~^ minimal Markov move can be constructed in this way. For the complex in 
Example 14.101 we also need moves of the form [ooii?] — [ooiio] along with 3 other 
moves obtained by replacing either of the first two columns with [1 1]-^ on both sides. 
The other 4 (which, together with the 16 from Example 14.101 give us all 24 quadratic 
elements in the Markov basis) are obtained from [ooioo]~[ooioi]- To study these 
we may want to look at the resolution, which has Betti diagram 








1 


2 


3 


total: 


1 


7 


10 


4 


0: 


1 








1: 




2 


1 




2: 




5 


9 


4 



Moreover, the lone linear syzygy of the 2 quadratic generators has multidegree (0, 0, 1, 1, 1) 
making us suspicious that this is the source of the quadratic moves missed by Proposi- 
tion SiTll Even in the smallest degrees the minimal generators of /a do not give enough 
information to construct all of the minimal Markov moves. 

The constructions described here can, in principle, be extended to construct a Markov 
move from a shift in the multi-graded resolution. This should give a result analogous 
to Proposition 14.111 however, as it would involve studying multi-graded shifts in the 
mapping cone construction, it is beyond the scope of this paper. 

4.4. Two more cases. We can settle two more special cases of Conjecture 11.11 

Proposition 4.14. If A is any simplicial complex and Pi,i+j{R/ I/i) 7^ for j < 2 and 

some i > 0, then Im has a degree 2^ minimal generator. 

Before presenting a proof, we need a technical result: 

Lemma 4.15. If H^{A] K) ^ for some j < 1, then there is some a C [n] and f G A 
such that H^~\\mkA{v)\a; K) ^0. 

Proof. Indeed, since j < 1 we may, without loss of generality, assume that dim A < 2. 
If j = 0, then A is disconnected, and so for every G A, there exists a w G A with 
w ^ linkA(f). If we let a := {w \ w ^ linkA(f)}, then linkA(t')|<j = {0}- Therefore, 



14 



SONJA PETROVIC AND ERIK STOKES 



If j = 1, then we may assume that dim(hnkA(f)) = 1. Let a be a cycle in A that is 
not a boundary, and v G cr. If cr is a 3-cycle, then linkA(t')|o- is 0-dimensionaI. If ex = 
{f , ai, 02, ... , Cfc}, then linkA(tO|o- is contained in the path {ai, 02, ... , Cfc}. Removing 
any vertex will give a disconnected complex, which then has non-zero 0-th cohomology, 
completing the proof. □ 



Proof of Proposition \4-14\ We need to show that R/ {I a '■ Xi) has a non-zero entry in row 



j — 1 of its Betti diagram. Using Hochster's formula, we know that ^(^k; K) 7^ for 
some r C [n]. Applying Lemma [4.151 to gives a vertex f e r and a face a C r with 
H^~^{\mkA{v)\a', K) 7^ 0. Hochster's formula then implies that [3ii^ii+j-iR/{lA ■ x^) 7^ 0. 
The remainder of the proof is identical to Case I of Theorem 14.81 □ 

The second special case handled is Gorenstein complexes of low dimension. We call 
a simplicial complex Gorenstein (over K) if /a is Cohen-Macaulay with codimension c 
and its last total Betti number, /3c, is 1. By [H Theorem 5.6.1] this is equivalent to A 
being a homological sphere, that is 



dimK Hi{\mkA{F)- K) 



i ^ dimlinkA(-F) 

1 i = dimlinkA(-F) 



In particular, any complex whose geometric realization is a sphere (for example, the 
octahedron, or a cycle) is Gorenstein. 

Proposition 4.16. //dim A = 2 and A is Gorenstein, then Im has a degree 2^ minimal 
generator whenever Pi,i+j{R/ Ia) 7^ for some i > 0. 

Proof. If j < 2, then this follows from Proposition 14.141 Since dim A < 2, there are no 
non-zero Betti numbers with j > 3, so we only need to verify the case when j = 3. We 
may assume that A is not a cone. From Equation ([T]) we know that i/^(linkA(tO! ^) — ^ 
for every u e A. Let a be the set of vertices in linkA(w). Then i/^(linkA(f )|(t; K) 7^ 0. 
Applying Hochster's formula and referring to Case I of Theorem 14.81 completes the 
proof. □ 

Remark 4.17. Another case of Conjecture 11.11 could be deduced, with a bit of effort, 
from the results in [23], by studying the circuits of the marginal matrix (see [22]). This 
would prove a weaker version: that the Stanley- Reisner ideal having a degree j minimal 
generator implies that Im has a generator of degree 2-'"^. Note that minimal generators 
are summarized by the first Betti numbers (column 1 of the Betti diagram). 

4.5. Moves not predicted by syzygies. We close this section by showing that the 
Conjecture does not predict all Markov moves for the hierarchical model. Even Theo- 
rem 14.81 only occasionally gives us an exact list of the degrees in the Markov basis, as 
the following example shows. 

Example 4.18. Contrast Example 11.21 with the complex whose Stanley-Reisner ideal is 

J = (x3X4X5,XiX2X5,XiX4X5,XiX2X3) , 

the Alexander dual of a 5-path. The Betti diagram over any field is 
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1 2 
total: 1 4 3 
0: 1 . . 
1: . . . 
2: . 4 3 

Notice that only a degree 4 generator is predicted for I^. Yet the binary model over 
this complex has generators in degrees 4, 6, 8, 10 and 12. 

5. Decomposable graphical models 

Even though the converse of Theorem 14.81 does not hold, its contrapositive can be 
used to obtain information about the Stanley-Reisner ideal from the Markov basis. For 
example, if the Markov basis contains elements of only 3 distinct degrees, then the Betti 
diagram of /a can contain at most 3 non-zero rows. In fact, it is likely that there 
will be fewer then 3, since Theorem 14.81 only includes the Markov basis elements whose 
degree is a power of 2. As a special case, the result provides a class of complexes whose 
Stanley-Reisner ideals have linear resolutions. 

Corollary 5.1. //A is vertex decomposable and Im is generated in a single degree, then 
Ja has a linear resolution over every field. 

One of the few cases where the generators of the toric ideal of the hierarchical model 
are worked out is the class of decomposable complexes, that is, clique complexes of 
chordal graphs. 

Definition 5.2. A complex A is decomposable if it is a simplex or has sub-complexes 
such that A = Ai U A2 where Ai fl A2 consists of a single facet (called the separator) 
and Ai, A2 are both decomposable. 

That these toric ideals are generated in degree 2 is a consequence of Theorem 4.4 in 
|15] . which characterizes decomposable models. 

Remark 5.3. Note that the term decomposable in the setting of [T3] refers to those 
undirected graphical models whose graphs are decomposable. A graph is called decom- 
posable if its clique complex is a decomposable complex. (See Section 3.3. of [11] for 
further discussion, and [9] for an earlier study of decomposable graphs.) Decomposable 
graphical models are not to be confused with graphs that are decomposable as simplicial 
complexes. 

Despite the similarity in the names, decomposable complexes are unrelated to vertex- 
decomposable complexes. 

Example 5.4. Let G be the graph with edges {1,2}, {2,3}, {1,3}, {2,4} and {3,5}. 
The clique complex A of G has facets {1,2,3}, {2,4} and {3,5}. It is decomposable, 
but not vertex decomposable, since it is not even pure. 

Proposition 5.5. Suppose A = Ai U A2 and Ai Ci A2 = S is a simplex or (that is, A 
is reducible // Conjecture holds for Ai, then it holds for A. 

In particular, it holds for decomposable complexes. More precisely, if A is decompos- 
able and (3i^i^j{R/lA) 7^ for some i > 0, then Im has a degree 2^ minimal generator. 
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Proof. On the toric ideal side, the toric ideal of A is a toric fiber product with separator 
S (see Theorem 3.5 in [30]). More precisely, the toric ideal of the union contains the 
toric ideals of the components along with a set of degree 2 generators. 

Suppose that Pi^i+jS/I/^ ^ 0. Then, by Hochster's formula, there is some a C [n] 
such that if-'~^(A|o-; K) ^ 0. Let (Tj be the part of a contained in Aj. Using the Mayer- 
Vietoris sequence ([T71 page 203]) we get that, since A|o- = Ailo-^ U ^2\c72 and Ai fl A2|o- 
is a simplex and thus has no cohomology, we can compute 

if j > 1. If the cohomology of A|o- is non-zero, then the cohomology of one of the Ajlo-^ 
is also non-zero. Then Hochster's formula and induction gives a degree 2^ generator in 
the Markov basis of one of the Aj. Then, the facts about toric fiber products above 
show that Im also has a degree 2^ generator. 

To prove the second claim, write A = Ai U A2 with both Aj decomposable. By 
induction. Conjecture 11.11 holds for Ai and A2. Since they intersect at a simplex, the 
result follows. □ 

Since it is known that the toric ideal of a decomposable model is generated in degree 
2, we get the following, combinatorial result: 

Corollary 5.6. //A is decomposable then has a 2-linear resolution over every field. 

Remark 5.7. The result above recovers a theorem of Froberg. Namely, clique complexes 
of chordal graphs are examples of fiag complexes. In [13], Froberg characterized the 
Stanley-Reisner rings of fiag complexes that have a linear resolution: they are exactly 
clique complexes of chordal graphs. His proof uses Mayer- Vietoris sequences and does 
not give a combinatorial formula for the Betti numbers. Recently, an easier proof of this 
proposition, along with a formula for the Betti numbers, appeared in [TOl Theorem 3.2.]. 



6. Geometric interpretation 

Geometrically, the Betti diagram is determined by the reduced simplicial cohomology 
of the complex and its sub-complexes via Hochster's formula (see Section [2]). 

Corollary 6.1. //A is vertex decomposable and H^{A;K) ^ for some j > and 
some field K , then Im has a degree 2^^^ minimal generator. 

Proof. Taking quotients if necessary, we may assume that /o(A) = n. Hoschter's formula 
says that if we have a vector a G {0, 1}" and A = supp(a), then we can compute the 
Z"-graded Betti numbers by 

dimH\^\-^-\AU;K)=(3^,{R/I^). 

Let A = [n], the only set such that A\a = A. Since all the Betti numbers of Ja are 
squarefree and (1, . . . , 1) is the only squarefree integer vector with sum n, we get that 
7^ dim if-' (A; K) = Pn-j-i,n{R/ ^a)- Therefore, the Betti diagram has a non-zero entry 
in row n — {n — j — 1) = j + 1. □ 
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Since vertex decomposable complexes are Cohen-Macaulay, at most one of the colio- 
mology groups is non-zero. We have stated the corollary in full generality because a proof 
of Conjecture [TTT] for a wider class of complexes would immediately make Corollary 16.11 
much more interesting. 

Example 6.2. Corollary 16.11 can be applied to all connected graphs since they are all 
vertex-decomposable. For example, if A is a cycle, or any graph containing a cycle as 
a vertex-induced subgraph, then Im must have a degree 4 element in its Markov basis, 
because A has non-zero first cohomology. 

Similarly, if A is a vertex decomposable simplicial /c-sphere, then H''{A;K) = K. 
Hence the Markov basis must contain a degree 2^^^ element. For example, any Markov 
basis of a hierarchical model associated to an octahedron contains at least one move 
with degree 8. 

Remark 6.3. For graphs, it has recently been shown in [21] that the hierarchical model 
is generated in degrees 2 and 4 if and only if the graph has no /r4-minors. If dim A = 1 
then, using Hochster's formula, Pi^i+jR/lA = whenever j > 2, so our theorem can 
only predict generators in degree 2 and 4. If A has a ii'4-minor then the dimension of 
H^{A] K) is at least that of H^{K4] K), which is 3. Assuming that A is connected, this 
means our theorem can give an exact listing of generators only if the rightmost entry in 
the Betti diagram is at most 2. This fails in dimension 2. 

7. Computations 

We conclude this note by a computational answer to a natural question motivated 
by Examples 14.101 14.121 and Remark 14.131 how much do the predictions using the 
constructions above differ from the actual Markov bases? 

The first table below summarizes the results of computer experiments in which we 
used 4ti2 to compute the Markov bases and compared them to the resolutions obtained 
using Macaulay2. All resolutions were computed over the rationals. The rows in the 
table give the number of vertices in the complex, the columns represent the number of 
degrees not predicted by Conjecture 11.11 The (i,j)-entry in the table is the number of 
complexes on i vertices for which j Markov degrees are not be predicted by our results. 





number of degrees not 


predicted 


n 





1 


2 


3 


4 


5 


6 


7 


3 


18 


1 














4 


44 


8 














5 


17 


3 


9 


24 


8 








6 


33 


4 


1 


5 






2 


2 


7 


1 
















8 


1 
















total 


114 


16 


10 


29 


8 




2 


2 



The second table summarizes the results for all complexes with 4 vertices (we leave 
out the 3-simplex as its Markov basis is trivial). All of the complexes except the last 
two fall under either Theorem 14.81 or Proposition 15.51 
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binary Markov 


predicted 


facets 


degrees 


degrees 


1,2,3,4 


2 


2 


12,3,4 


2 


2 


12,23,4 


2 


2 


123,4 


2 


2 


12,23,34 


2 


2 


12,23,34,41 


2, 4 


2, 4 


12,23,31,34 


2, 4 


2, 4 


12,23,31,4 


2, 4 


2, 4 


12,23,34,41,13 


2, 4 


2, 4 


12,13,14,23,24,34 


4, 6 


4 


123,34 


2 


2 


123,24,34 


2,4 


2,4 


123,234 


2 


2 


123,124,134,234 


8 


4 


123,24,34,14 


4, 6 


4 


123,234,14 


4 


4 



Notice that only two complexes (indicated in bold) contain generators in degrees not 
predicted by our conjecture. Interestingly, the first is the complete graph Ka,^ which, by 
|24] . must have a generator with degree larger than 4. 
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